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Novel calpains and the use thereof 

The invention relates to a novel mcunmalian calpain CAPNll^ to its 
5 synthesis and to its use. 

calpains are a superfamily of related proteins, some of which 
have been demonstrated to act as calcium-dependent cysteine 
proteases. Eight different calpains have been identified in 
10 mammals. 

calpains form a family t>± intracellular calcium-dependent 
cysteine proteases. An increasing number of mconmalian calpain 
homologs has been identified, and the individual members can be 

15 divided into four classes on the basis of the physical structure 
and the predicted properties. Class A, the ^'classical calpains" 
CAPNl, CAPN2, CAPN3 (p94); CAPN8 (nCL-2) and CAPN9 (nCL-4) 
probably all have protease activity and are Ca2+-dependent- They 
consist of a subunit of variable size (80 kDa) and an invariant 

20 small subunit (30 kDa). Class B and D calpains, CAPN5 (6, 15) and 
CAPN7 (8) have protease activity but are very probably 
Ca2+-independent ; the class C calpain, CAPN6, probably has no 
protease activity. The calpains can also be divided into 
categories on the basis of their expression patterns, with CAPN3, 

25 CAPN6, CAPN8 and CAPN9 showing some tissue specificity. The 
function of the calpains is unknown although they have been 
connected with a large number of physiological processes and 
pathological states (review in reference 17). To elucidate their 
function and evolutionary history it is necessary to identify the 

30 entire spectrum of calpain family members. 

The invention relates to the novel polypeptide CAPNll having the 
amino acid sequence disclosed in SEQ. ID No. 2. 

35 The novel calpain protein CAPNll has the properties typical of 
calpains including potential protease and calcium-binding 
domains . It displays a greatly restricted tissue distribution and 
is expressed mainly in the testis. With the aid of radiation 
hybrid mapping, the gene has been localized to chromosome 6 in a 

40 region which is assigned to pl2. Phylogenetic analysis reveals 
that CAPNll in mammals is most closely related to CAPNl and 
CAPN2 . 

However, the predicted CAPNll sequence has the greatest homology 
45 with the chicken calpain ^t/m among the available calpain 

sequences. Thus, CAPNll may be the human ortholog of |x/m calpain. 
The discovery of this novel calpain emphasizes the complexity of 
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the calpain family, the members of which can be distinguished on 
the basis of the protease activity r calciiim dependence and tissue 
expression. 

5 The cDNA nucleotide sequence of the CAPNll gene contains 2338 
nucleotides (SEQ ID No. 1). The cDNA sequence is derived from a 
single mRNA by successful amplification of the complete presumed 
coding region from human testis cDNA by means of flanking 
primers. Several cDNA clones have been completely sequenced in 
10 order to exclude any PGR artefacts. 

There is a large open reading frame which encodes a protein 
having 702 amino acids (Mr 80 kDa) (Fig. 1). The amino acid 
sequence resembles the large subunit of members of the calpain 

15 family (Fig. 1). The. protein can be divided into the four 

calpain-typical domains. Domain II displays the properties of a 
protease domain, and the predicted amino acid sequence has the 
three amino acid residues (Cysl02, His259 and Asn283) which are 
part of the active site of cysteine proteases (2). The amino acid 

20 sequence of all five Ca2+-binding sequences described for CAPN2 
(4, 12) are conserved to a certain extent (Fig. 1). This protein 
thus probably has protease and calcium-binding properties. 
Comparison of the predicted amino acid sequence with those of all 
other calpains revealed the greatest sequence homology (57.5%) 

25 with the chicken calpain jx/m. In mammalian calpains, the greatest 
similarity was with h\aman CAPNl (54.3% homology). The least 
similar human calpain, with only 18.7% homology, was CAPN6 . The 
gene coresponding to this cDNA has been designated CAPNll by the 
Human Gene Nomenclature Committee. 

30 

The complete amino acid sequence of all identified human calpains 
has undergone phylogenetic analysis. The results make it possible 
to classify human calpains into four main evolutionary groups 
(Fig. 2). The first group is represented by CAPN5, CAPN6, CAPN7 
35 and CAPN8, the second by CAPNl and CAPN2, the third group by 
CAPN3 and CAPN9, and the fourth comprises CAPNll. The 
phylogenetic analysis thus suggests that CAPNll represents a 
separate calpain subfamily. 

40 The expression of CAPNll in human tissues was investigated by 
Northern and RNA dot-blot analyses. Of the 50 tissue RNAs 
investigated, the CAPNll mRNA was expressed most strongly in the 
testis (Fig. 3A) . The specificity of this signal was confirmed by 
Northern blot analysis and corresponded to an mRNA about 3 kb in 

45 size (Fig. 3D). Much weaker signals were detected in the thymus 
and in the meuranary gland. The significance of this finding is, 
however, unclear because further investigation of thymus RNA by 
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Northern blot analysis produced no signal, despite long exposure 
times (Fig. 3D and data not shown) . A possible explanation would 
be that this weak signal is attributable to cross-hybridization 
with related mRNAs. The testis is thus the main expression site 
5 of CAPNll, although we are unable to preclude the possibility 
that the gene is expressed in other tissues which were not 
investigated . 

We have determined on which chromosome the human CAPNll gene is 
10 located. We have assigned by means of PGR with primers which are 
specific for the human CAPNll nucleotide sequence, and with a 
somatic human/rodent cell hybrid mapping panel, the gene to 
chromosome 6 (Correll Cell Repositories). With the aid of 
radiation hybrid mapping using the medium-resolution Stanford 
15 G3 panel (Research Genetics Inc.) and the database at the 

Stanford Human Genome Center (shgc-www.stanford.edu), the gene 
was located 5 cent ir ay away from the marker SHGC-32834 on this 
chromosome (LOD score 12.87). This marker is located in the 
interval between the microsatellite markers D6S1616 (59.6 cM) and 
20 D6S427 (73.9 cM) (7), and a marker in this interval, D6S269, has 
been cytogenetically assigned to 6pl2 (5). Thus, CAPNll is 
located on chromosome 6 in the vicinity of pl2. No other calpain 
gene has been located on this chromosome. 

25 The chicken \i/m calpain was the first member of the calpain 
family to be cloned (16). It was originally referred to as 
m calpain, but was reclassified after identification of other 
chicken calpains which are very probably orthologous to the 
mammalian \i and m calpains (18). However, no mammalian \i/m 

30 calpain has yet been determined. Since, however, CAPNll displays 
greater homology with chicken \i/m calpain than with other 
mammalian calpains, it may be the ortholog thereof. 

There are to date 5 calpains displaying a certain degree of 
35 tissue specificity - CAPN3 (skeletal muscle), CAPN6 (placenta), 
CAPN8 (possibly smooth muscle), CAPN9 (stomach and small 
intestine) and CAPNll (testis). Numerous proteases have been 
identified in the testis, and it is assumed that they are 
involved in processes such as tissue reorganization (20), 
40 regulation of spermatogenesis (14), penetration of the Zona 
pellucida by sperm (10) and fertility (13). However, many of 
these activities are dependent on secreted proteases, and CAPNll 
probably has an intracellular localization. In the testis, it 
might be involved in processes in which calpains in other tissues 
45 are involved, such as germ cell apoptosis (3) or regulation of 
testis-specif ic transcription factors . 
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A further aspect of this invention relates to the use of the 
polypeptide CAPNll for identifying substances which are able to 
inhibit the enzymatic activity of this polypeptide, called 
calpain inhibitors, in particular those calpain inhibitors which 
5 are selective for CAPNll. Selectivity means that such calpain 
inhibitors inhibit the activity of CAPNll more than the activity 
of the other calpains mentioned above, in particular prefercibly 
at least 10 times, and more preferably 25 times, more. The 
enzymic activity of CAPNll is a Ca-dependent protease activity. 

10 

A further aspect of the invention relates to a method for 
identifying compounds which inhibit the enzymic activity of a • 
polypeptide as claimed in claim 1, comprising: 

15 (a) comparing the extent of the enzymatic activity of CAPNll in 

the presence of the compound with the extent of the enzymatic 
activity of CAPNll in the absence of the compound and 

(b) selecting compounds which alter the extent of the enzymatic 
20 activity of CAPNll compared with the enzymatic activity of 

CAPNll in the absence of the compound. 

The inhibiting compounds identified by the method mentioned above 
are suitable for the treatment of disorders associated with or 
25 linked to a nonphysiologically elevated CAPNll activity, such as 
infertility in men. 

The dosage and the treatment regimen for these inhibitors must be 
determined by routine methods known for other protease 
30 inhibitors • 
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FIGURES 



FIG. 1. Grouping of the predicted amino acid sequence of CAPNll 

20 with other human calpains . The multiple grouping of the amino 
acid sequences was carried out using CLUSTT^ W (19). The 
presumptive starting methionine for CAPNll (GGAatgG) corresponds 
to the minimal consensus sequence for the translation starting 
site (RNNatgG, where R is a purine, Ref. 11)* Amino acids in the 

25 other proteins which are identical to those of CAPNll are shaded. 
Dashes indicate gaps which have been introduced to maximize the 
alignment. Arrowheads point to the three conserved amino acids 
which are part of the active site of the calpains. The potential 
EF-hand calcium-binding domains of CAPN8 are underlined and 

30 numbered consecutively. The arbitrary domains of calpain are 
indicated. The sequences for CAPN4 and CAPN7, which have been 
identified only in rat and mouse respectively , are not shown. The 
published CAPN6 sequence has not been included because it is only 
a partial sequence. The alternative names and accession numbers 

35 for the grouped calpains are indicated in the legend for Fig. 2. 

FIG. 2. Rootless phylogenetic tree of the family of the large 
subunit of human calpains. The analysis was caxried out with the 
PAUP program, and the tree was arranged using CLUSTREE from the 

40 HUSAR server of the Deutsche Krebsf orschungszentrums , Heidelberg 
(www.dkfz-heidelberg.de) . The lengths of the horizontal lines are 
proportional to the derived phylogenetic distances; the vertical 
lines have no significance. 1000 bootstrapping repetitions were 
carried out, and the values are shown at the internal nodes. The 

45 CAPN7 sequence is derived from the mouse because only little of 
the human nucleotide and protein sequence is available. The human 
ortholog does in fact exist (see Ref. 8) so that the use of the 
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mouse sequence for this comparison is justified. The partial 
hximan CAPN8 sequence is the predicted translation of the EST 
clone AA026030 (Hillier et al., 1995, The WashU-Merck 
EST project, unpublished results). An amino acid translation of 
5 this clone shows great similarity with the rat CAPN8 sequence. No 
bootstrapping was carried out with this sequence because it is 
much shorter than the others. Thus no sensible comparison of the 
bootstrapping value with the values from comparisons of 
full-length sequences is possible. The nomenclature specified by 

10 the Human Gene Nomenclature Committee is used. The previous names 
for the various calpains are: CAPNl - m calpain, CAPN2 - 
m calpain, CAPN3 - p94, nCl-1, CAPN8, nCL-2, CAPN9, nCL-4 . The 
EMBL accession numbers, for the calpain sequences used are: CAPNl 
(P17655), CAPN2 (P07384), CAPN3 (P20807), CAPN5 (Y10656), CAPN6 

15 (Y12582), CAPN7 (AJ012475) andCAPN9 (AF022799). 

FIG. 3. Expression of CAPNll. A 32p_iabeled DNA probe with an 800 
base pair segment of the coding sequence of the human CAPNll cDNA 
was hybridized on a master blot (A) , a nylon filter with 

20 dot -blots of RNAs from 50 different human tissues or a Clontech 
multiple tissue Northern blot (D). The filters were washed with 
high stringency ( 6x SSC, 65**C) . The exact site of the various RNAs 
on the dot-blot filter is shown diagrammatical ly (C). The RNAs on 
the Northern blot are indicated over the corresponding lanes. 

25 Dot -blot and Northern blot were rehybridized with DNA probes for 
human ubiquitin (B) and P-actin in order to determine the cunounts 
of poly(A+) RNA loaded. The sites of the size markers (in 
kilobases) are indicated for the Northern blot. The exposure 
times were: A, 72 h; 24 h; D, 48 h, PBL « peripheral blood 

30 leukocytes. 
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